Balls into Bins: strict Capacities and Edge Weights

نویسنده

  • Ankur Sahai
چکیده

We explore a novel theoretical model for studying the performance of distributed storage management systems where the data-centers have limited capacities (as compared to storage space requested by the users). Prior schemes such as Balls-into-bins (used for load balancing) neither consider bin (consumer) capacities (multiple balls into a bin) nor the future performance of the system after, balls (producer requests) are allocated to bins and restrict number of balls as a function of the number of bins. Our problem consists of finding an optimal assignment of the online producer requests to consumers (via weighted edges) in a complete bipartite graph while ensuring that the total size of request assigned on a consumer is limited by its capacity. The metric used to measure the performance in this model is the (minimization of) weighted sum of the requests assigned on the edges (loads) and their corresponding weights. We first explore the optimal offline algorithms followed by the analysis of different online techniques (by comparing their performance against the optimal offline solution). LP and Primal-Dual algorithms are used for calculating the optimal offline solution in O(r · n) time (where r and n are the number of requests and consumers respectively) while randomized algorithms are used for the online case. We propose randomized online algorithms in which the consumers are selected based on edge probabilities (that can change with consumer failures; due to capacity exhaustion) and evaluate the performance of these randomized schemes using probabilistic analysis. The performance of the online algorithms is measured using competitive analysis assuming an oblivious adversary who knows the randomized algorithm but not the results produced. For the simplified model with equal consumer capacities an average-case competitive ratio (which compares the average cost of the output produced by the online algorithm and the minimum cost of the optimal offline solution) of ( d mini, j di, j ) (where d is the edge weight / distance) is achieved using an algorithm that has equal probability for selecting any of the available edges with a running time of O(r). In the extending the model to arbitrary consumer capacities we show an average case competitive ratio of ( d·c c·mini, j di, j ) . This theoretical model gives insights to a (storage) cloud system designer about, how the different attributes (producer requests, edge weights and consumer capacities) effect the overall (read / write) performance of a distributed storage management system over a period of time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Weighted Balls-into-Bins Games

We consider the well-known problem of randomly allocating m balls into n bins. We investigate various properties of single-choice games as well as multiple-choice games in the context of weighted balls. We are particularly interested in questions that are concerned with the distribution of ball weights, and the order in which balls are allocated. Do any of these parameters influence the maximum...

متن کامل

Fractional Matching Via Balls-and-Bins

In this paper we relate the problem of finding structures related to perfect matchings in bipartite graphs to a stochastic process similar to throwing balls into bins. Given a bipartite graph with n nodes on each side, we view each node on the left as having balls that it can throw into nodes on the right (bins) to which it is adjacent. If each node on the left throws exactly one ball and each ...

متن کامل

Balanced offline allocation of weighted balls into bins

We propose a sorting-based greedy algorithm called SortedGreedy[m] for approximately solving the offline version of the d-choice weighted balls-into-bins problem where the number of choices for each ball is equal to the number of bins. We assume the ball weights to be nonnegative. We compare the performance of the sorting-based algorithm with a näıve algorithm called Greedy[m]. We show that by ...

متن کامل

Perfectly Balanced Allocation With Estimated Average Using Expected Constant Retries

Balanced allocation of online balls-into-bins has long been an active area of research for efficient load balancing and hashing applications. There exists a large number of results in this domain for different settings, such as parallel allocations [1], multi-dimensional allocations [5], weighted balls [4] etc. For sequential multi-choice allocation, where m balls are thrown into n bins with ea...

متن کامل

Balanced Allocations: Balls-into-Bins Revisited and Chains-into-Bins

The study of balls-into-bins games or occupancy problems has a long history since these processes can be used to translate realistic problems into mathematical ones in a natural way. In general, the goal of a balls-into-bins game is to allocate a set of independent objects (tasks, jobs, balls) to a set of resources (servers, bins, urns) and, thereby, to minimize the maximum load. In this paper ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1206.3634  شماره 

صفحات  -

تاریخ انتشار 2012